Are you aware of Federated engine in MySQL (apart from MyISAM and InnoDB)?
This engine allows you to define a table that sucks data from another table, even from a remore server. The tables definition must be the same.
I use it for the following:
Every time I rebuild the project, I have wait for 15 minutes while two big tables are created and filled with data — these are geo data tables (world cities, regions, etc), 4 mln records, and POI table, 2 mln records. I use Federated tables to create two separate databases and just link these tables in my project.
These tables are shared between several environments (dev, test and live) on the same server.
To check if your MySQL server has the Federated engine supported, you can use just a phpMyAdmin — go to home page of you phpMyAdmin installation (click Home picture), then choose Engines tab and check there.
If it’s not enabled (gray), open your my.ini file, find the “[mysqld]” part and make it to look like this:
[mysqld]
federated
P.S. If you have an error in the table definition, phpMyAdmin shows your database as empty. To fix this, log in via mysql console and try to make a SELECT from this poorly defined table and you get the error message to work with.
Now let’s create a SQL function for handy converts. Create a udf.sql file and add this in it:
DELIMITER //
DROP FUNCTION IF EXISTS EXCHANGE;
CREATE FUNCTION EXCHANGE( amount DOUBLE, cFrom CHAR(3), cTo CHAR(3) ) RETURNS DOUBLE READS SQL DATA DETERMINISTIC
COMMENT 'converts money amount from one currency to another'
BEGIN
DECLARE rateFrom DOUBLE DEFAULT 0;
DECLARE rateTo DOUBLE DEFAULT 0;
SELECT `rate` INTO rateFrom FROM `currency` WHERE `code` = cFrom;
SELECT `rate` INTO rateTo FROM `currency` WHERE `code` = cTo;
IF ISNULL( rateFrom ) OR ISNULL( rateTo ) THEN
RETURN NULL;
END IF;
RETURN amount * rateTo / rateFrom;
END; //
DELIMITER ;
and run this command in your shell:
mysql --user=USER --password=PASS DATABASE < udf.sql
This is how you can use this function — how to convert 10 US dollars to Canadian dollars:
SELECT EXCHANGE( 10, 'USD', 'CAD')
which results in $10 = 10.93 Canadian dollars.
P.S. Consider adding the currency export action call to your cron scripts.
P.P.S. A function to unzip the data file can be got at
A problem comes on stage when a Flash file uploader is added to your project – usually it cannot “login” to your site, i.e. users are not able to use the Flash file uploader behind beta login.
That’s how I solved it.
It’s not the web server who must solve this (Apache), it’s the application server (PHP). So remove the lines above from .htaccess and use for this purpose — it’s Zend’s HTTP Authentication Adapter.
What concerns the Flash uploader: it sends ‘Shockwave Flash’ as value of ‘User-Agent’ request header. So in your Initializer or Bootstrap file (where you load Zend_Auth_Adapter_Http) check this header value, and if it’s not Flash’s, go for HTTP authentication.
P.S. Hackers can assume this and fake the header to access your site. To cope with that, use an additional secret request variable (Flash uploaders allow this) and check it at server side.
Reverse Geocoding is process opposite to Geocoding (when you get map coordinates by city/country given). So the idea is to get city name by coordinates on the map.
Why you may need it? For example, user is supposed to add a marker on the map, and you want to check that the marker is within a particular country or region.
Solutions:
(JavaScript) can work with coordinates to retrieve location name. Advantage is that return information is translated to the preferred language of your browser. Here is you can play with.
offers several services, among which you can find Reverse Geocoding web service (REST or JSON) which can work with coordinates and postal codes. Advantage — you can download city/region/country names and POI with coordinates.
Data feeds often come in XML format, so your application must be able to deal with that format.
As I already wrote, data in CSV format (comma separated values) can be loaded to database extremely fast. So my idea was to convert XML data files to CSV and then use bulk load to database. My tests shown that this is faster in 10-100 times than one by one inserts.
Yesterday I decided to write a generalized solution for this, and it turned out that there is no need: it’s just coming — MySQL 6 will have such feature!
How it works: you create a table, name its columns exactly as XML nodes/attributes names or — and MySQL server will load it correspondently.
Example — you downloaded a POI list file (Points of Interest) called poi.xml that looks like this:
CREATE TABLE IF NOT EXISTS `poi` (
`lat` varchar(255) NOT NULL,
`lon` varchar(255) NOT NULL,
`name` varchar(255) NOT NULL
) DEFAULT CHARSET=utf8;
OK, now you load the XML data to your table:
LOAD XML INFILE '\\path\\to\\poi.txt'
INTO TABLE `poi`
ROWS IDENTIFIED BY '<wpt>'
Voila!
The good thing is that MySQL 6 is already available in alpha version — good enough for development purposes; I gave it a try — it takes 5 seconds to load 4.8 Mb of data in 19 files.
You have a web application hosted on a server and you want to set file access permissions by chmod command, different for files and folders.
Why different? A folder must have ‘x’ (access) flag:
drwxr-xr-x
At the same time a file shouldn’t have it (otherwise it would be executable):
rw-r--r--
How can you set it up for all nested files/folders in your application folder?
You can solve this task by setting common persmissions for all files and folders (step 1) and then make folders accessable (step 2), recursively:
chmod -R 644 .
chmod -R a+X .
But chmod allows you to set up only access flag for folders, you cannot set one persmission for folders and another for files.
Here is a script written by my collegue Anton [vojtoshik] Voitovych to solve this task.
Copy it and save to a file (I saved it in /bin/rchmod — thus it’s accessable for all users of my system):
#!/bin/bash
files="0644";
directories="0755";
path=$1;
shift;
while getopts "d:f:" OPTION
do
case $OPTION in
f) files="$OPTARG" ;;
d) directories="$OPTARG" ;;
esac
done
if [ "$path" != '' ]; then
find "$path" -type d ! -name "." -exec chmod "$directories" {} \;
find "$path" -type f -exec chmod "$files" {} \;
else
echo "Usage: rchmod <path_to_directory> [-d <directories mode>] [-f <files mode>]";
echo -ne \\n;
fi
OK, now make it executable:
chmod 755 rchmod
How to use it.
1. You are in your application folder and want to set 755 (drwxr-xr-x) mode for folders and 644 (rw-r--r--) mode for files which is the default:
rchmod .
2. You want to set 123 permissions only for folders in a particular folder::
rchmod /some/path/to/folder -d 123
3. You want to set 123 permissions for folders and 456 permission for files in a particular folder::
rchmod /some/path/to/folder -d 123 -f 456
Note: don’t forget that every time after that command run you should make some folders writable (wp-content/uploads, cache, temp — what else you have in your webapp).
After authentication in my project it takes about 10-20 minutes for the auth session to expire, which is not handy — you go to get a snack and see a login screen coming back.
This is how to make the TTL of your auth session longer:
There are situations when you need to separate processing of big amount of data between several “agents”, e.g.:
you have a long list of websites which must be checked for being alive (404 error check) by your web-clawlers;
a queue of photos to be resized or videos to be converted;
articles that your editors must review;
catalogue of blog feeds that your system must import posts from;
etc.
The idea to do this is simple:
Give a small piece of big work to an agent.
Mark this piece as given to him (so that none other starts to do the same job) and remember the time stamp when the job was given or when the job becomes obsolete (this agent is dead, let’s give this job to someone else).
If work is done — go to step #1.
After some period of time (1 hour) check all the time stamps, and if some agents didn’t cope with the job, mark the jobs as free so that others could start to work on it.
The problem is between steps #1 and #2 — while you gave a job to Agent 1 and going to mark it as given to him, what if Agent 2 is given by the same job? If you have many Agents, this can happen at real. This situations is called concurrent read/write.
To overcome this a lock can be used.
In this article I wil explain, how to use locks in Zend project with MySQL database.
First of all, MySQL documentation tells that SELECT .. FOR UPDATE can be used for that purpose. First step is to select records by that statement, and second step is to mark them as locked. Requirements are to use InnoDB storage and to frame these two statements in a transaction.
Happily, Zend_Db_Table_Select has a special method forUpdate() that implements SELECT .. FOR UPDATE statement. Zend_Db can cope with transactions as well. Let’s try it!
To lock a record, we need two fields:
one to remember ID of agent that is processing this record (let’s call this column ‘locked_by‘)
one another to know the time when the lock becomes obsolete (let’s call this column ‘expires_at‘)
I wrote a class that inherits from Zend_Db_Table and helps to get records with locking them.
If the table has a composite primary key (containing more than one column), the ActiveRecord approach is used, so the save() method for every record is called, that’s simple (drawback — multiple update queries). Otherwise, if it is a deep-seated table with one ID column as a primary key, then the IDs are collected in a list and all records are updated by a single statement with IN in where clause (which is much faster).
TTL (‘Time to Live‘) — period of time when lock is allowed. In my application the default is one hour. Format of TTL can be seen in .
And now how to use it.
Let’s imagine you have several editors that divide the big articles list and review them. My model class has a method fetchForUser() that returns no more than 5 articles for current user (by given user ID).
This is an Article table model, inherited from the class above. Usually such classes are located at
Note: if the editor refreses the page, the expres_at fields is refreshed by current time as well.
As for step four of our algorithm (releasing all obsolete locks) — create an action in your backend controller, call your table model releaseLocks() method in it and call that action periodically by Cron.
To boost the performance of the lock releasing, create an index on the expires_at column. (Because of this reason I rejected the ‘locked_since‘ column in favor of ‘expires_at‘)
P.S. In my database date/time columns have DATETIME type. If you use INT to store timestamps, convert it to unix time and back.
I have a couple of other WordPress blogs on the same server besides this one. One day I realised that all of them have 3 different WP versions and, as result, different admin areas which is not handy. I decided to make them use the same WordPress installation.
Ok, first of all, I deleted wp-admin and wp-includes folders and created new ones as symbolic links. Though the frontend worked well, I couldn’t log in into admin area, because the browser was redirected to that blog which was the base for all the rest for unknown reason.
The investigation shown, that admin area of WordPress is a separate sub-application, thing in itself, and in my case it resolves the absolute path to its source as the path to the base blog. I wanted each blog to use its own folder because there are config file and uploads folder.
It took me some time to find a solution. It requires two steps.
First, I added this line to the top of the .htaccess to make any PHP request to the blog (the blog front-end and the admin area scripts) call the same script before thier start:
#fix for several sites on the same WP installation
php_value auto_prepend_file "/var/www/site_doc_root/prepend.php"
In this code /var/www/ is the root folder for all my sites, and the site_doc_root is the document root of the current site (folder where all its files are located).
OK, the 2nd step — the contents of the prepend.php script. It is easy — it just must define an absolute path constant which is used all around the WordPress:
<?php
define('ABSPATH', dirname(__FILE__).'/');
OK, after that I decided not to use one of the blogs as source for others, but download a fresh copy of WordPress and make it a source of the symbolic links for all my blogs. This helps to update them.
Then I deleted wp-admin and wp-includes folders and some wp-files and recreated them as symlinks. Attention to wp-config.php — don’t delete it, keep it unique for every site!
To make this task easier, I created setup.sh file, pasted the contents I show below, run this command
chmod 755 setup.sh
then I copied it in every site folder and launched there for every site:
This will download a fresh copy of the wordpress if it’s changed (though wordpress team doesn’t show the file Last-Modified header, I think one day they will) and unzip it to /var/www/wordpress/ folder which is the source for our symlinks.
Yes, you got it right — launching this script is all I need to update all my blogs.
Let’s make it periodic:
crontab -e
and then add this line to run the update process automatically every 1st day of every month at 9 AM:
Recent Comments